Model-based deconvolution of genome-wide DNA binding
نویسندگان
چکیده
MOTIVATION Chromatin immunoprecipitation followed by hybridization to a genomic tiling microarray (ChIP-chip) is a routinely used protocol for localizing the genomic targets of DNA-binding proteins. The resolution to which binding sites in this assay can be identified is commonly considered to be limited by two factors: (1) the resolution at which the genomic targets are tiled in the microarray and (2) the large and variable lengths of the immunoprecipitated DNA fragments. RESULTS We have developed a generative model of binding sites in ChIP-chip data and an approach, MeDiChI, for efficiently and robustly learning that model from diverse data sets. We have evaluated MeDiChI's performance using simulated data, as well as on several diverse ChIP-chip data sets collected on widely different tiling array platforms for two different organisms (Saccharomyces cerevisiae and Halobacterium salinarium NRC-1). We find that MeDiChI accurately predicts binding locations to a resolution greater than that of the probe spacing, even for overlapping peaks, and can increase the effective resolution of tiling array data by a factor of 5x or better. Moreover, the method's performance on simulated data provides insights into effectively optimizing the experimental design for increased binding site localization accuracy and efficacy. AVAILABILITY MeDiChI is available as an open-source R package, including all data, from http://baliga.systemsbiology.net/medichi.
منابع مشابه
Post-translational changes of histones, methylation level, and ERβ protein level in the cumulus cell genome of infertile women with endometriosis
Background: Endometriosis (which affects up to 50% of infertile women) is one of the major causes impacting female infertility. Endometriosis, defined as the presence of endometrial glands and stroma outside the uterine tissue, causes a wide range of functional disorders in the process of follicular development and changes in the follicular milieu, resulting in the formation of an incompetent o...
متن کاملBioinformatics Genome-Wide Characterization of the WRKY Gene Family in Sorghum bicolor
The WRKY gene family encodes a large group of transcription factors that regulate genes involved in plant response to biotic and abiotic stresses. Sorghum is a notable grain and forage crop in semi-arid regions because of its unusual tolerance against hot and dry environments. We identified a set of 85 WRKY genes in the S. bicolor genome and classified them into three groups (I–III). Among the ...
متن کاملO-11: N-a-acetyltransferase 10 Protein Regulates DNA Methylation and Embryonic Development
Background Genomic imprinting is a heritable and developmentally essential phenomenon by which gene expression occurs in an allele-specific manner1. While the imprinted alleles are primarily silenced by DNA methylation, it remains largely unknown how methylation is targeted to imprinting control region (ICR), also called differentially methylated region (DMR), and maintained. Here we show that ...
متن کاملPredicting CpG Islands and DNA Methlation in the Cow Genome Using DNA Microarray Meta-Analysis and Genome Wide Scanning
DNA methylation is a type of epigenetic changes that directly affects DNA. In mammals, DNA methylation is essential for fetal development and stem cell differentiation and this phenomenon essentially occurs within the CpG islands. In this study, two methods were used to study the DNA methylation profile of cow genome. In the first method, the DNA methylation profile of the differentially expres...
متن کاملdPeak: High Resolution Identification of Transcription Factor Binding Sites from PET and SET ChIP-Seq Data
Chromatin immunoprecipitation followed by high throughput sequencing (ChIP-Seq) has been successfully used for genome-wide profiling of transcription factor binding sites, histone modifications, and nucleosome occupancy in many model organisms and humans. Because the compact genomes of prokaryotes harbor many binding sites separated by only few base pairs, applications of ChIP-Seq in this domai...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 24 3 شماره
صفحات -
تاریخ انتشار 2008